Selecting Strategies for Infinite-Horizon Dynamic LIMIDS
نویسندگان
چکیده
In previous work we have introduced dynamic limited-memory influence diagrams (DLIMIDs) as an extension of LIMIDs aimed at representing infinite-horizon decision processes. If a DLIMID respects the first-order Markov assumption then it can be represented by 2TLIMIDS. Given that the treatment selection algorithm for LIMIDs, called single policy updating (SPU), can be infeasible even for small finite-horizon models, we propose two alternative algorithms for treatment selection with 2TLIMIDS. First, single rule updating (SRU) is a hill-climbing method inspired upon SPU which needs not iterate exhaustively over all possible policies at each decision node. Second, a simulated annealing algorithm can be used to avoid the local-maximum policies found by SPU and SRU.
منابع مشابه
Solving infinite horizon optimal control problems of nonlinear interconnected large-scale dynamic systems via a Haar wavelet collocation scheme
We consider an approximation scheme using Haar wavelets for solving a class of infinite horizon optimal control problems (OCP's) of nonlinear interconnected large-scale dynamic systems. A computational method based on Haar wavelets in the time-domain is proposed for solving the optimal control problem. Haar wavelets integral operational matrix and direct collocation method are utilized to find ...
متن کاملForecast Horizons for a Class of Dynamic Games
In theory, a Markov perfect equilibrium of an infinite horizon, non-stationary dynamic game requires from players the ability to forecast an infinite amount of data. In this paper, we prove that early strategic decisions are effectively decoupled from the tail game, in non-stationary dynamic games with discounting and uniformly bounded rewards. This decoupling is formalized by the notion of a “...
متن کاملAdmissibility in Infinite Games
We analyse the notion of iterated admissibility, i.e., avoidance of weakly dominated strategies, as a solution concept for extensive games of infinite horizon. This concept is known to provide a valuable criterion for selecting among multiple equilibria and to yield sharp predictions in finite games. However, generalisations to the infinite are inherently problematic, due to unbounded dominance...
متن کاملAsynchronous Games with Transfers: Uniqueness and Optimality in Infinite Horizon∗
This paper and its companion, Dutta-Siconolfi (2016a) proves a First Welfare Theorem for Games. It shows that infinite horizon asynchronous dynamic games with voluntary one period ahead transfers have a unique equilibrium that coincides with the Utilitarian Pareto Optimum and hence can be computed from a (simpler) programming problem (rather than as a fixed point). The only way that multiplicit...
متن کاملConvergence of trajectories in infinite horizon optimization
In this paper, we investigate the convergence of a sequence of minimizing trajectories in infinite horizon optimization problems. The convergence is considered in the sense of ideals and their particular case called the statistical convergence. The optimality is defined as a total cost over the infinite horizon.
متن کامل